SfM-Net: Learning of Structure and Motion from Video
نویسندگان
چکیده
We propose SfM-Net, a geometry-aware neural network for motion estimation in videos that decomposes frameto-frame pixel motion in terms of scene and object depth, camera motion and 3D object rotations and translations. Given a sequence of frames, SfM-Net predicts depth, segmentation, camera and rigid object motions, converts those into a dense frame-to-frame motion field (optical flow), differentiably warps frames in time to match pixels and back-propagates. The model can be trained with various degrees of supervision: 1) self-supervised by the reprojection photometric error (completely unsupervised), 2) supervised by ego-motion (camera motion), or 3) supervised by depth (e.g., as provided by RGBD sensors). SfMNet extracts meaningful depth estimates and successfully estimates frame-to-frame camera rotations and translations. It often successfully segments the moving objects in the scene, even though such supervision is never provided.
منابع مشابه
Global Structure-from-Motion and Its Application
Structure-from-motion (SfM) is a fundamental problem in 3D computer vision, with the aim of recovering camera poses and 3D scene structure simultaneously given a set of 2D images. SfM methods can be broadly divided into incremental and global methods according to their ways to register cameras. Incremental methods register cameras one by one, while global SfM methods solve all cameras simultane...
متن کاملVideo Subject Inpainting: A Posture-Based Method
Despite recent advances in video inpainting techniques, reconstructing large missing regions of a moving subject while its scale changes remains an elusive goal. In this paper, we have introduced a scale-change invariant method for large missing regions to tackle this problem. Using this framework, first the moving foreground is separated from the background and its scale is equalized. Then, a ...
متن کاملAmbiguities in Camera Self-Calibration
Structure from motion (SfM) is the problem of computing the 3D scene and camera parameters from a video or collection of images. SfM can be further classified as calibrated and un-calibrated. In calibrated SfM, the internal camera parameters are known. This is a much easier problem than the un-calibrated case, where these parameters are unknown. Solving for the internal camera parameters are kn...
متن کاملReview: Recent Structure-From-Motion Algorithms 3D Shape Reconstruction
Existing face recognition systems are based on 2D facial images and exhibit well-known deficiencies. Accordingly, the face recognition research is gradually shifting from classical 2D to sophisticated 3D or hybrid 2D/3D. 3D shape reconstruction from multiview photographs and video sequences (2D images) is an active area of research which can fully leverage the potential of existing 2D image acq...
متن کاملReal-time Structure from Motion for Augmented Reality
Our work is focused on developing a real-time structure from motion (SfM) algorithm that is usable in an augmented reality system. We envisage augmented reality applications involving a head-mounted camera and display system. This requires an SfM algorithm that is robust to different scene structure and camera motion and will invariably have to deal with the problems of occlusion, clutter and m...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1704.07804 شماره
صفحات -
تاریخ انتشار 2017